Mispronunciation detection without nonnative training data
نویسندگان
چکیده
Conventional mispronunciation detection systems that have the capability of providing corrective feedback typically require a set of common error patterns that are known beforehand, obtained either by consulting with experts, or from a humanannotated nonnative corpus. In this paper, we propose a mispronunciation detection framework that does not rely on nonnative training data. We first discover an individual learner’s possible pronunciation error patterns by analyzing the acoustic similarities across their utterances. With the discovered error candidates, we iteratively compute forced alignments and decode learner-specific context-dependent error patterns in a greedy manner. We evaluate the framework on a Chinese University of Hong Kong (CUHK) corpus containing both Cantonese and Mandarin speakers reading English. Experimental results show that the proposed framework effectively detects mispronunciations and also has a good ability to prioritize feedback.
منابع مشابه
An Application of Modified Confusion Network for Improving Mispronunciation Detection in Computer- aided Mandarin Pronunciation Training
In this paper, we propose an application of confusion network for Mandarin mispronunciation detection. Compared to former published works, which are proven to work effectively and robustly in detecting mispronunciation in word level and only successfully detect mispronunciation in sentence level in strictly small constrained search space, our modified confusion network based Computer-aided Pron...
متن کاملAutomatic detection of phone-level mispronunciation for language learning
We are interested in automatically detecting specific phone segments that have been mispronounced by a nonnative student of a foreign language. The phone-level information allows a language instruction system to provide the student with feedback about specific pronunciation mistakes. Two approaches were evaluated; in the first approach, log-posterior probability-based scores [1] are computed fo...
متن کاملMaximum F1-Score Discriminative Training for Automatic Mispronunciation Detection in Computer-Assisted Language Learning
In this paper, we propose and evaluate a novel discriminative training criterion for hidden Markov model (HMM) based automatic mispronunciation detection in computer-assisted pronunciation training. The objective function is formulated as a smooth form of the F1-score on the annotated non-native speech database. The objective function maximization is achieved by using extended Baum Welch form l...
متن کاملVowel mispronunciation detection using DNN acoustic models with cross-lingual training
We address the automatic detection of phone-level mispronunciation for feedback in a computer-aided language learning task where the target language data (Indian English) is limited. Based on the recent success of DNN acoustic models on limited resource recognition tasks, we compare different methods of utilizing the limited target language data in the training of acoustic models that are initi...
متن کاملMispronunciation Detection Leveraging Maximum Performance Criterion Training of Acoustic Models and Decision Functions
Mispronunciation detection is part and parcel of a computer assisted pronunciation training (CAPT) system, facilitating second-language (L2) learners to pinpoint erroneous pronunciations in a given utterance so as to improve their spoken proficiency. This paper presents a continuation of such a general line of research and the major contributions are twofold. First, we present an effective trai...
متن کامل